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Abstract 

w-powers of finitary languages are w-languages in the form V^, where 
V is a finitary language over a finite alphabet S. Since the set of 
infinite words over £ can be equipped with the usual Cantor topology, 
the question of the topological complexity of cj-powers naturally arises 
and has been raised by Niwinski [Niw90j, by Simonnet [Sim92j, and 
by Staiger [Sta97b] . It has been proved in [FinOlj that for each integer 
n > 1, there exist some w-powers of context free languages which are 
n^-complete Borel sets, and in [Fin03| that there exists a context free 
language L such that L w is analytic but not Borel. But the question 
was still open whether there exists a finitary language V such that 
is a Borel set of infinite rank. 

We answer this question in this paper, giving an example of a finitary 
language whose cu-power is Borel of infinite rank. 

Keywords: Infinite words; w-languages; w-powers; Cantor topology; topological 
complexity; Borel sets; infinite rank. 



1 Introduction 



w-powers are cu-languages in the form V u , where V is a finitary language. 
The operation V — > V w is a fundamental operation over finitary languages 
leading to u;-languages. This operation appears in the characterization of 
the class REG^ of a>regular languages (respectively, of the class CF^ of 
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context free ^-languages) as the w-Kleene closure of the family REG of reg- 
ular finitary languages (respectively, of the family CF of context free finitary 
languages) [Sta97a] . 

The set S w of infinite words over a finite alphabet £ is usually equipped with 
the Cantor topology which may be defined by a distance, see [Sta97aJ [PP04] . 
One can then study the complexity of cu-languages, i.e. languages of infinite 
words, by considering their topological complexity, with regard to the Borel 
hierarchy (and beyond to the projective hierarchy) [Sta97a] [PP04j . 
The question of the topological complexity of tu-powers of finitary languages 
naturally arises. It has been posed by Niwinski [Niw90] , by Simonnet |Sim92j 
and by Staiger |Sta97b] . The cu-power of a finitary language is always an an- 
alytic set because it is the continuous image of a compact set {0, 1, ... , n} w 
for n > or of the Baire space u^, |Sim92j |Fin01j . It has been proved in 
|Fin01] that for each integer n > 1, there exist some empowers of context free 
languages which are n°-complete Borel sets, and in |Fin03j that there exists 
a context free language L such that L u is analytic but not Borel. 
But the question was still open whether there exists a finitary language V 
such that V u is a Borel set of infinite rank. 

We answer this question in this paper, giving an example of a finitary lan- 
guage whose tu-power is Borel of infinite rank. 

The paper is organized as follows. In Section 2 we recall definitions of Borel 
sets and previous results and we proved our main result in Section 3. 

2 Recall on Borel sets and previous results 

We assume the reader to be familiar with the theory of formal cu-languages 
|Tho90] . [Sta97aj . We shall use usual notations of formal language theory. 

When E is a finite alphabet, a non-empty finite word over S is any sequence 
x = a± . . . 0^ , where a; 6 S for % = 1, . . . , k , and k is an integer > 1. 
The length of x is k, denoted by \x\. The empty word has no letter and is 
denoted by A; its length is 0. For x = a%. . .a^, we write x{i) = cii and 
x[i] = x(l)...x(i) for i < k and x[0] = A. X* is the set of finite words 
(including the empty word) over S. 

The first infinite ordinal is u. An u- word over S is an u -sequence a\ . . . a n . . ., 
where for all integers i > 1, a; 6 E. When a is an w-word over S, 
we write a = cr(l)a"(2) . . . a{n) . . ., where for all i, a{i) G S, and a[n] = 
a(l)a(2) . . . a(n) for all n > 1 and a[0] = A. 
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The prefix relation is denoted Q a finite word u is a prefix of a finite word u 
(respectively, an infinite word v), denoted u □ t> , if and only if there exists a 
finite word (respectively, an infinite word w), such that v = u.w. The set 
of uj- words over the alphabet E is denoted by An u- language over an 
alphabet X is a subset of E w . 

For V C £*, the uj-power of V is the u;-language: 

V" = {a = Ul . . . w n . . . G | Vz > 1 Ui G V - {A}} 

We assume the reader to be familiar with basic notions of topology which may 
be found in [Mos80] [LT94] |Kec95j [Sta97aj [FFolj . For a finite alphabet X, 
we consider X w as a topological space with the Cantor topology The open 
sets of X u are the sets of the form W.X W , where W C X*. A set L C A w is 
a closed set iff its complement X w — L is an open set. Define now the Borel 
Hierarchy of subsets of X u : 

Definition 2.1 For a non-null countable ordinal a, the classes S° and DI° 
of the Borel Hierarchy on the topological space X^ are defined as follows: 
S° is the class of open subsets of X w . 
TL® is the class of closed subsets of X u . 
and for any countable ordinal a > 2: 

S° is the class of countable unions of subsets of X w in U 7<Q DI°. 

DI° is the class of countable intersections of subsets of X w in U 7<Q S°. 

For a countable ordinal a, a subset of X 1 ^ is a Borel set of rank a iff it is in 
E° U n° but not in U 7 <a( S ° u n °)- 

There are also some subsets of X u which are not Borel. In particular the 
class of Borel subsets of X u is strictly included into the class X* of analytic 
sets which are obtained by projection of Borel sets, see for example [Sta97a| 
[LT94] |PP04| [Kec95] for more details. 

We now define completeness with regard to reduction by continuous func- 
tions. For a countable ordinal a > 1, a set F C X u is said to be a S° 
(respectively, DI°, Sj)- complete set iff for any set E C Y w (with Y a finite 
alphabet): E G S° (respectively, E G 11°, E G Sj) iff there exists a con- 
tinuous function / : Y w — > A w such that = f~ 1 (F). S° (respectively 
n°J -complete sets, with n an integer > 1, are thoroughly characterized in 
[Sta86j . 
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In particular 1Z = (0*.1)' J is a well known example of Il^-complete subset of 
{0, 1}". It is the set of cj-words over {0, 1} having infinitely many occurrences 
of the letter 1. Its complement {0, 1} W — (0*.1)" J is a S^-complete subset of 
{0,1}-. 

We shall recall the definition of the operation A — > A~ over sets of infinite 
words we introduced in [FinOlj and which is a simple variant of Duparc's 
operation of exponentiation A — > A~ |Dup01| . 

For a finite alphabet £ we denote = S u U E*. Let now «- a letter not 
in £ and J = SU {^}- For a; G X-^, x^~ denotes the string x, once every 
«- occuring in x has been "evaluated" to the back space operation ( the one 
familiar to your computer!), proceeding from left to right inside x. In other 
words x^~ = x from which every interval of the form "a «- " (a G £) is 
removed. We add the convention that (u. «-)*~ is undefined if \u*~\ = 0, i.e. 
when the last letter «- can not be used as an eraser (because every letter of 
£ in u has already been erased by some erasers «- placed in u). Remark that 
the resulting word x^ may be finite or infinite. 

For example if u — (a «-) n , for n > 1, or u — (a then = A, 

if u = (ab then {u)*~ = a w , if « = oo(<^ a) w then («)*" = 6, 

if u (a or u = a <^<^- a? or u = (a <^<^) w then (u)*~ is undefined. 

Definition 2.2 For A C £ w , A~ = {x G (£ U | G A}. 

The following result follows easily from |Dup01| and was applied in [FinOlj 
to study the u;-powers of finitary context free languages. 

Theorem 2.3 Let n be an integer > 2 and ACE" be a complete set. 
Then A~ is a n° +1 - complete subset of (£ U { < ^}) UJ ■ 

For each tu-language ACE", the w-language A~ can be easily described 
from A by the use of the notion of substitution which we recall now. 
A substitution is defined by a mapping / : £ — > P(T*), where £ = {ai, . . . , a n } 
and T are two finite alphabets, / : — > Li where for all integers i G [1; n], 
f(a,i) = Li is a finitary language over the alphabet T. 

Now this mapping is extended in the usual manner to finite words: /(a^ ■ ■ • di„) = 
L ix . . .L in , and to finitary languages L C £*: f(L) = U xe if(x). If for each 
integer i G [1; n] the language Lj does not contain the empty word, then the 
mapping / may be extended to c<>words: 

f(x(l) . . . x(n) . . .) = {u! . . . u n . . . | Vz > 1 Ui e f(x(i))} 
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and to cu-languages L C E^ by setting f(L) = U x( zLf(x). 

Let Li = {w G (EU-f^})* | = A}. L\ is a context free (finitary) language 
generated by the context free grammar with the following production rules: 
S — > aS «- S with a G E; and S 1 — > A (where A is the empty word). 
Then, for each u;-language A C the a>language A~ C (E U {<^}) aJ is 
obtained by substituting in A the language L x .a for each letter a G E. 

By definition the operation A — > A~ conserves the w-powers of finitary lan- 
guages. Indeed if A = V w for some language l^CS* then A~ = g(V u ) = 
(g{V)) w where g : E — > P((E U {^})*) is the substitution defined by 
g(a) = Li.a for every letter a G E. 

3 An empower which is Borel of infinite rank 

We can now iterate k times this operation A — > A". More precisely, we 
define, for a set A C E w : 

4 J = WT, ^d 

where we apply k times the operation A — > A~ with different new letters 
«-/c-i, • • • ,«s, «~2, m such a way that we have successively: 

Af° = A C E w , 

AfC(SU{n,n-i}r, 

A^C(EuKn-i,..,-i}) u 

For a reason which will be clear later we have chosen to successively call the 
erasers «-fc_i, . . . , «-2, in this precise order. We set now A~- h = A^ M 
so it holds that 

^c(suKn-i,.,n}r 

Notice that definitions of A"' 1 , A^' 2 , . . . , A^ k ~^ were just some intermediate 
steps for the definition of A~- k and will not be used later. 

We can also describe the operation A — > A~ fc in a similar manner as in the 
case of the operation A — > by the use of the notion of substitution. 
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Let L k C (E U • • • , be the language containing (finite) 

words w, such that all letters of u have been erased when the operations of 
erasing using the erasers «-2, • • • , «~k-h «~k, are successively applied to 
u. 

Notice that the operations of erasing have to be done in a good order: the 
first operation of erasing uses the eraser then the second one uses the 
eraser «- 2 , and so on . . . 

So an eraser may only cruise db letter of S or an other "eraser" for 
some integer i > j. 

It is easy to see that L k is a context free language. In fact L k belongs to the 
subclass of iterated counter languages which is the closure under substitution 
of the class of one counter languages, see [ABB96J [Find J for more details. 

Let now h k be the substitution: E — > U {<^- k , «-&-]., • • • , defined 
by h k (a) = L k .a for every letter a G E. 

Then it holds that, for A C E w , A~- k = h k (A), i.e. A~- k is obtained by 
substituting in A the language L k .a for each letter a e E. 

The ^-language ^ = (0M) W = V w , where V = (OM), is n^-complete. 
Then, by Theorem 12.31 for each integer p > 1, ^(V 4 ') = (/i P (V))^ is a np +2 - 
complete set. 

We can see that the languages L k , for k > 1, form a sequence which is strictly 
increasing for the inclusion relation: 

LiCL 2 cL 3 C...CLiC L i+ i . . . 

In order to construct some a>-power which is Borel of infinite rank, we could 
try to substitute the language U k >\L k .a to each letter a G E. But the 
language U k >\L k .a is defined over the infinite alphabet EU{«-i, «- 2) • • •}? 
so we shall first code every eraser by a finite word over a fixed finite 
alphabet. The eraser «-j will be coded by the finite word a.fl 3 .at over the 
alphabet {a,j3}, where a and (3 are two new letters. 

The morphism tp p : (EU{«-i, . . . , ^ p })* — > (SUja, f3})* defined by <p p (c) = c 
for each c G E and ip p («-j) = a -ft .a. for each integer j G [l,p], can be 
naturally extended to a continuous function ip p : (E U . . . , ^ P }) w — > 
(SU{a,/3}) w . 
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Let now 

n>l 

and /i : £ — > P((E U {a,/?})*) be the substitution denned by 

h(a) = C.a 

for each a e E. 

We can now state our main result: 

Theorem 3.1 Let V = (OM). T/ien toe empower (fc(V)) w C {0,l,a,/3} w «s 
a 5ore/ set of infinite rank. 

To prove this theorem, we shall proceed by successive lemmas. 

Lemma 3.2 For all integers p > 1, the u-language ip p (TZ R1 ' p ) is a II p+2 - 
complete subset of (E U {a, I3}Y . 

Proof. First we prove that ip p (lZ R1 - p ) is in the class II p+2 . 

For E = {0, 1}, ip p ((^ U {«-i, . . . , ^ P }) w ) is the continuous image by ip p of 

the compact set (E U • • • , ^-p})^, hence it is also a compact set. 

The function ift p is injective and continuous thus it induces an homeomor- 

phism ifj' p between the two compact sets (Eu{«-i, . . . , «- p }Y and -0 p ((EU{«-i 

We have already seen that, for each integer p > 1, the w-language 1Z~ /p is 
a n° +2 -complete subset of (E U {<«-!, . . . , ^ p }) w . Then ^(^~- p ) is a n p+2 - 
subset of ^((EU , ^p}) 1 ^), because the function ip' is an homeomor- 

phism. 

But one can prove, by induction over the integer j> 1, that each IT? subset 
K of ip p ((H U . . . , ^ P }) w ) is also a IT? subset of (E U {a,/3}) w '. Thus 
f p (n^- p ) = 4> p (n~ p ) is a n° +2 -subset of (E U {a,/?}) w . 
Remark now that the set 7£~ ,p being II p+2 -complete, every LI p+2 -subset of 
X w , for X a finite alphabet, is the inverse image of 7Z~' P by a continuous 
function. But it holds that 1Z~ /p = i/j~ 1 (iJjp(Tl~- p )), where ip p is a continuous 
function. Thus every II p+2 -subset of X^, for X a finite alphabet, is the 
inverse image of vf) p (7l~' p ) by a continuous function. Therefore i/j p (71~' p ) is 
also a n p+2 -complete subset of (E U {a, (3})^. □ 

Lemma 3.3 The set (h(V)Y is not a Borel set of finite rank. 
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Proof. Consider, for each integer p > 1, the regular cu-language 

R p = V P ({0, 1, «- 2j . . . , = {0, 1, a./?.a, a./3 2 .a, . . . , a./3 p .a}" 

We have seen that R p is compact hence it is a closed set. And by construction 
it holds that (h(V))" n R p = ip p (h p (V")) = 4> p (1l~ p ) where K = V w , so by 
Lemma this set is a IIp +2 -complete subset of {0, 1, a, (3}". 

If (/i(V)) w was a Borel set of finite rank it would be in the class 11° for some 
integer J > 1. But then {h(V)Y fl R p would be the intersection of a Ilj-set 
and of a closed, i.e. Il^-set. Thus, for each integer p > 1, the set (h(V)) u f)Rp 
would be a LTj-set. This would lead to a contradiction because, for J = p, a 
Ilj-set cannot be a IIp +2 -complete set. □ 

Lemma 3.4 Every u-word x G (^(V)) 1 ^ /ias a unique decomposition of the 
form x — U1.U2 ■ ■ - U n . . . where for alii > 1 G /i(V). 

Proof. Towards a contradiction assume on the contrary that some w-word 
x G (/i(V)) w = (/i(0*.l)) w has (at least) two distinct decompositions in words 
of h(V). So there are some words Uj,u'j G h(V), for j > 1, such that 

x = U\.u-i . . . u n . . . = u\.u' 2 . . . u' n . . . 

and an integer J > 1 such that Uj = u'j for j < J and uj C m'j, i.e. itj is a 
strict prefix of u'j. Then for each integer j > 1, there are integers n^, > 
such that Uj G h(0 nj .1) and G /i(0 n J.l). Thus there are some finite words 
G C, where i is an integer in [1, rij + 1], and w\ G £, where i is an integer 
in [1, n'j + 1], such that 

uj = v{.0.v 3 2 .0 . . . v J n ..0.v 3 n]+1 .l and u'j — u^.O.u^-O . . . w 3 n , fi.w 3 n , +v l 

We consider now 2 given by its first decomposition x — u\.U2 ■ ■ ■ u n . . . 
Let now x^ be the w-word obtained from x by using the (code of the) eraser 
«-i as an eraser which may erase letters 0, 1, and (codes of the) erasers <^ p 
for p > 1. Remark that by construction these operations of erasing occur 
inside the words v\ for j > 1 and i G [1,71^ + 1]. 

Next let X 1 - 2 ) be the tu-word obtained from x^ 1 ' by using the (code of the) 
eraser «- 2 as an eraser which may erase letters 0, 1, and (codes of the) 
erasers «- p for p > 2. Again these erasing operations occur inside the words 
v\. 

We can now iterate this process. Assume that, after having successively used 
the erasers ^2, ■ ■ ■ , «~n, for some integer n > 1, we have got the cu-word 
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x ( n ) from the a;-word x. We can now define x^ n+1 ^ as the w-word obtained 
from X*™) by using the (code of the) eraser ^ n+ \ as an eraser which may 
erase letters 0, 1, and (codes of the) erasers «- p for p > n + 1. 

We shall denote = min{k > 1 | v\ G (p k (L k )}. Then v\ G ip K(i]) (L K{ij) ) 

for all integers j ' > 1 and z G [1,% + 1]- Thus after Kj = max{K^^ \ i G 
[1, rij + 1]} steps all words v?, for 2 G [1, rij + 1], have been completely erased 
and, from the finite word Uj, it remains only the finite word 0^.1. 
After K J = max{Kj \ j G [1, J]} steps, from the word U1.U2--.uj, it re- 
mains the word ni .1.0 n2 .1.0 n3 .l . . . nj .l which is a strict prefix of x^ K \ In 
particular the J-th letter 1 of x^ K ) is the last letter of uj, which has not 
been erased, and it will not be erased by next erasing operations. 

Notice that the successive erasing operations are in fact applied to x inde- 
pendently of the decomposition of x in words of h(V). So consider now the 
above erasing operations applied to x given by its second decomposition. 
Let K'u-s = min{k > 1 | wj G (pk(L k )}, and K' J = max{K^^ | j G 

[1, J] and i G [1, v! j + 1]}. K> = K'i holds for 1 < j < J and K J < K >J . 
We see that, after K' 3 steps, the word ni .1.0" 2 .1.0 n3 .l . . . nj - 1 .1.0 n J.l is a 
strict prefix of x^ K ^ . The J-th letter 1 of x^ K is the last letter of u'j (which 
has not been erased). But we have seen above that it is also the last letter 
of uj (which has not been erased). 

Thus we would have uj = u'j and this leads to a contradiction. □ 

Remark that in terms of code theory Lemma 13.41 states that the language 
h(V) is an a>code. 

Lemma 3.5 The set (hiV)) 1 ^ is a Borel set. 

Proof. Assume on the contrary that (^(V))^ is an analytic but non Borel set. 

Recall that lemma 4.1 of |FS03] states that if X and Y are finite alphabets 

having at least two letters and B is a Borel subset of x Y u such that 

PROJx^(B) = {cr G X w I 3u (a,u) G B} is not Borel, then there are 2 K ° 

w-words a G X u such that B a = {v G \ (cr, v) G B} has cardinality 2 K ° 

(where 2 K ° is the cardinal of the continuum). 

We can now reason as in the proof of Fact 4.5 in [FS03J. 

Let 6 be a recursive enumeration of the set h(V). The function 9 : N — > h(V) 

is a bijection and we denote Ui = 0{i). 

Let now V be the set of pairs (a, v) G {0, 1} W x {0, 1, a, (3}^ such that: 
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1. a G (0*.l)^, so a may be written in the form 

a = ni .1.0 n2 .1.0" 3 .l . . .0 np .1.0 np+1 .l . . . 
where Vi > 1 rii > 0, and 

2. 

^ ^ni '^712 - ^7i3 • • • ^rip-^rtp+i • • • 

D is a Borel subset of {0,1}^ x {0,l,a,/3}' J because it is accepted by a 
deterministic Turing machine with a Biichi acceptance condition [Sta97aJ. 
But PRO «7{Q ) i,a, j 3}w(X > ) = (/i(V)) w would be a non Borel set thus there would 
be 2 N ° w-words i/ in (/i(V)) 1 ^ such that V v has cardinality 2 K °. This means 
that there would exist 2 K ° w-words v G (hiV))^ having 2 K ° decompositions 
in words in h(V). 

This would lead to a (strong) contradiction with Lemma 13.41 □ 
Theorem 13.11 follows now directly from Lemmas 13.31 and 13.51 

4 Concluding Remarks 

We already knew that there are w-powers of every finite Borel rank |Fin01j . 
We have proved that there exists some w-powers of infinite Borel rank. The 
language h(V) is very simple to describe. It is obtained by substituting in 
the regular language V = (0M) the language C.a to each letter a G {0, 1}, 
where C = U n >i Vn(-^n)- Notice that the language C is not context free but 
it is the union of the increasing sequence of context free languages (p n (L n ). 
Then £ is a very simple recursive language and so is h(V). 
The question is left open whether there is a context free language W such 
that W w is Borel of infinite rank. 

The question also naturally arises to know what are all the possible infinite 
Borel ranks of tu-powers of finitary languages or of finitary languages belong- 
ing to some natural class like the class of context free languages (respectively, 
languages accepted by stack automata, recursive languages, recursively enu- 
merable languages, . . . ). 

Acknowledgements. We thank the anonymous referee for useful comments 
on a preliminary version of this paper. 
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